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TAACAAGTCA TTCAAACACG GCGAAGATGT CTATGCGTAT AACAGAATAT TTGGAGAAAT 120 

GCTCGCAAAA CTCGAAATTG TCACCGATAA AATGATTAAC TTGAAGGGGC TAATGTAAGT 180 

TATCTGATGT TTCTACAATT AAAAAAATTG TTTTTTTTTC CAAATTAATT TTCGAAGATT 240 

AACGAAAAAC GATTAAAAAT CAATAAAACG CAATAAAGAG GGCTTGGCTT TCTTTTTAAT 300 

TTAAATTATA ATTTTTCTGA TTGTTGTATG AAGCTACAAA ATGTACTGTT TTTGTATTTG 360 

AATATTGTAT TACAGGGTTG GGATTCTCGG CAAATATCAG CGACAGTGGA AGATTTAGAA 420 

GAAGGACGTG TGACAATCAC TAAGTCAAAG AGGGAAAGGA TAAAGGATTG TGATATTTCA 480 

CTGTTTTACT CATTCGCTTT TTAAATAAGA ACTATATGCC GATTTGCCGA TATATTTTTG 540 

TTTATTAGGC CTCTCACATT CCTGTACAAT GTTTCTACCA AATAAACTGC ATTTTTATCT 600 

GAAAATTCGA ATTTATTTTT GTCTACTTTT TACTCGTTGC ATTCGAGATC AGCATATCTT 660 

CCGGTCTATT TATATTCAAC GATTTTTATA AATTAGTACT CCTTCATGTT TAATTTCATT 720 

TTATCTGTAA GCTTTACTGT ATTTTTTTAA AATCTTTCTT GCTTCTATCT GATTATACAA 780 

TGTTCTTTAC TCATTTTCAA GGTATTTTTA TGCCTCACAA TTTATGCACA TTTCGGGCTT 840 

GGAGATTTAT CCTCTATATT ACATGCCTGT TTTTTTAAAG GATATAATGT TTAACAAATA 900 

ATTTTTTATC AATGCTATTG TATATTCTCC AGCTAACCGT TGTTTCGAAA ACATCACCTA 960 

GCATTTTAAA ATTCACAAAA TCTTGCTTCC TTATAATCAA GAAGATTTTT CAGATGCTCT 1020 

GCGAAATCGA ATGCCGCGCT TTGAGCACGG CACACACGAG GCTCATCCAC GACTTTGAAC 1080 

CACGTGACGC ATTGACTTAT TTAGAAGGCA AAAACATTTT CACAGAAGAT CATTCTGAAC 1140 

TTATCAGTAA AATGTCAACT CGCCTCGAGA GGATCGCCAA TTTTCTTCGA ATCTATCGAC 1200 

GTCAAGCTTC TGAACTTGGA CCACTCATCG ACTTTTTCAA CTACAACAAT CAAAGTCACC 1260 

TTGCTGATTT CCTCGAAGAC TACATCGATT TTGCGATAAA TGAGCCAGAT CTACTTCGTC 1320 

CAGTAGTGAT TGCTCCACAA TTTTCCCGAC AAATGCTCGA TAGGAAACTA TTGCTTGGGA 1380 

ATGTTCCAAA ACAAATGACA TGCTATATTC GAGAGTATCA CGTGGATCGA GTGATCAAAA 1440 

AGCTCGACGA GATGTGTGAT TTAGGTGAGA AAACTGGAAG CTCTCGTGTT TATTATAATC 1500 

TTGCTTAAAC TTCAGACTCC TTTTTTCTGT TTCTACACGG CCGAGCTGGA TCCGGAAAAT 1560 

CAGTAATTGC ATCACAAGCT CTTTCGAAAT CTGACCAACT TATTGGAATG TGAGTGGTAT 1620 

TATCTGAATC TACGGATCTT CATTCTATTA CAGAAATTAT GATTCAATCG TTTGGCTCAA 1680 

AGATAGTGGA ACAGCTCCAA AATCTACATT CGATTTATTT ACGGATATTT TGCTGATGCT 1740 

AAAGTGAGTG AATAGAGTGC ATGTAACATT CAGCATGATT TTGAAATTAT GAAAATTTGA 1800 

CCTGGTTAGC TTTTAATTTG ATATTTCGTG ACGCTTGCAT GTTTTGTGTG TTTGAAGACG 1860 

AGCCCGTGTT GTGAGCGACA CGGATGACTC GCATTCGATC ACCGACTTCA TTAACCGTGT 1920 

TCTTTCAAGA AGCGAAGACG ATCTTCTCAA TTTCCCATCG GTGGAGCATG TCACGTCAGT 1980 
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TGTACTCAAA AGGATGGTAA GTTGCTTGCC GATTCTGGTA CAATATCTTA AATTATTGGT 2040 

TTTTAGATCT GCAACGCACT CATTGATCGT CCAAATACTT TATTCGTATT TGATGACGTA 2100 

GTTCAAGAAG AAACAATTCG TTGGGCTCAG GAGCTACGTC TTCGATGTCT TGTAACTACT 2160 

CGTGACGTGG AAATATCAAA TGCTGCTTCT CAAACATGCG AATTCATTGA AGTGACATCA 2220 

TTGGAAATCG ATGAATGTTA TGATTTTCTA GAAGCTTATG GAATGCCGAT GCCTGTTGGA 2280 

GAAAAAGAAG AAGATGTGCT TAATAAAACA ATCGAACTAA GCAGTGGAAA TCCAGCAACG 2340 

CTTATGATGT TTTTCAAGTC TTGTGAACCG AAAACATTTG AAAAGTGAGT GGGACATACC 2400 

AATTTGAGAC TTTTAAAATA ATTTATTCTA CAATAAAAGT TAATCAAAAA GTTTCATAGC 2460 

TGATTGTCTT TAAATTTTAC GAATTGAGGA TCAAAATCAA GAATTAGGAT CCTGGCACGA 2520 

GAGAAAACTG TGTAGCTACC GTACCCGAGA GATTTTCTTG ATATTTGCCA TCGATTTAAT 2580 

TTTTTAAGAA AATTATCGTT TTACATAATT GAACAAGAGA TACACGGTCT CGACCCGACG 2640 

GAAATTTTTT AAATGAAAGC GAGTATGAGC CTGTTTTCAT TATTTTTCGA TTTTCTCTTG 2700 

TTGTTTCTTT TTATTTAAAG CCTTTTATTT TGAAACAAGT CTAAAAATAT TAAAAACTGA 2760 

ATAAAATATT TAAAAAAAAT CAAGTAAAAT AGAAAAACAG CAAGGCTGGA GACTACTGTA 2820 

CTTCTTAAAT CCGCATACTC TTTTTATTTA ATCATTTTCC GGAATGTCGA AACGAAATAA 2880 

TACATTTTTA GTCCAAAATC GCTAGGTATA TTCTTAAAAT TATCAAACAT TTTGCATTCA 2940 

GAATGGCACA GCTTAATAAC AAATTGGAAA GTCGAGGATT AGTCGGTGTT GAATGTATCA 3000 

CCCCTTACTC GTACAAGTCA CTCGCAATGG CTCTTCAAAG ATGTGTTGAA GTTTTGTCAG 3060 

ATGAGGATCG AAGTGCTCTT GCTTTCGCAG TTGTGATGCC TCCTGGAGTT GATATACCCG 3120 

TCAAGCTATG GTCATGTGTT ATTCCAGTTG ATATTTGTTC AAATGAAGAA GAACAATTGG 3180 

ATGATGAAGT TGCGGATCGG TTGAAAAGAC TCAGCAAGTA TGAGTCTTGA AATTTGAAGA 3240. 

TTTAAATTAA CACTTAAAAT TTCAGACGTG GAGCTCTTCT CAGTGGAAAA CGAATGCCCG 3300 

TTTTGACATT CAAAATTGAT CATATTATCC ATATGTTCTT GAAACACGTC GTTGATGCAC 3360 

AAACTATCGC CGTATGCTGA AAATGTCTCA ACTTTCAATT AAATTTTAAA TTTTCAGAAT 3420 

GGAATCTCAA TTCTCGAGCA GCGTCTTCTT GAAATAGGAA ACAATAATGT ATCAGTACCG 3480 

GAGCGACATA TACCATCACA TTTCCAAAAA TTCCGTCGTT CATCAGCCAG TGAGATGTAT 3540 

CCAAAAACTA CAGAAGAAAC TGTGATCCGT CCTGAAGACT TCCCAAAGTT CATGCAATTG 3600 

CACCAGAAAT TCTATGACTC CCTCAAAAAT TTTGCATGCT GTTAAAACCT ATCGTGTACA 3660 

ATATTGCCTG TATATTCCCC TCGAAATACG TTTATACTTT TTCGCACGAG TTTTCTCATT 3720 

TTTTCATTTG TACTTGTTTT ATTTCTCTCC AAAATTTCAG ATCTATCCCA AATGTTCTTA 3780 

AATTTAATGT TTTCTACAGA TACTCAACAC ATCTTGTTTC ATCTCATCCT TGCTTTTTTT 3840 

TTTCAAATAT ATTCAGTTTC TTTTATAATT TTAATTAATC GAATTAATAC ATTCACGTAA 3900 
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(2) INFORMATION FOR SEQ ID NO: 2: 











(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 549 amino acids 

(B) TYPE: amino acid 

(C) STRANDED NESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Leu Cys Glu He Glu Cys Arg Ala Leu Ser Thr Ala His Thr Arg 
15 10 15 

Leu He His Asp Phe Glu Pro Arg Asp Ala Leu Thr Tyr Leu Glu Gly 
20 25 30 

Lys Asn He Phe Thr Glu Asp His Ser Glu Leu He Ser Lys Met Ser 
35 40 45 

Thr Arg Leu Glu Arg He Ala Asn Phe Leu Arg He Tyr Arg Arg Gin 
50 55 60 

Ala Ser Glu Leu Gly Pro Leu He Asp Phe Phe Asn Tyr Asn Asn Gin 
65 70 75 80 

Ser His Leu Ala Asp Phe Leu Glu Asp Tyr He Asp Phe Ala He Asn 
85 90 95 

Glu Pro Asp Leu Leu Arg Pro Val Val He Ala Pro Gin Phe Ser Arg 
100 105 HO 

Gin Met Leu Asp Arg Lys Leu Leu Leu Gly Asn Val Pro Lys Gin Met 
115 120 125 

Thr Cys Tyr He Arg Glu Tyr His Val Asp Arg Val He Lys Lys Leu 
130 135 140 

Asp Glu Met Cys Asp Leu Asp Ser Phe Phe Leu Phe Leu His Gly Arg 
145 150 155 160 

Ala Gly Ser Gly Lys Ser Val He Ala Ser Gin Ala Leu Ser Lys Ser 
165 170 175 

Asp Gin Leu He Gly He Asn Tyr Asp Ser He Val Trp Leu Lys Asp 
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Ser Gly Thr Ala Pro Lys Ser Thr Phe Asp Leu Phe Thr Asp lie Leu 
195 200 205 

Leu Met Leu Lys Ser Glu Asp Asp Leu Leu Asn Phe Pro Ser Val Glu 
210 215 220 

His Val Thr Ser Val Val Leu Lys Arg Met He Cys Asn Ala Leu He 
225 230 235 240 

Aso Ara Pro Asn Thr Leu Phe Val Phe Asp Asp Val Val Gin Glu Glu 
. 245 250 255 

Thr He Arg Trp Ala Gin Glu Leu Arg Leu Arg Cys Leu Val Thr Thr 
260 265 270 

Arq Asp Val Glu He Ser Asn Ala Ala Ser Gin Thr Cys Glu Phe He 
275 280 285 

Glu Val Thr Ser Leu Glu He Asp Glu Cys Tyr Asp Phe Leu Glu Ala 
290 295 300 

Tvr Gly Met Pro Met Pro Val Gly Glu Lys Glu Glu Asp Val Leu Asn 
305 310 315 320 

Lys Thr lie Glu Leu Ser Ser Gly Asn Pro Ala Thr Leu Met Met Phe 
325 330 335 

Phe Lys Ser Cys Glu Pro Lys Thr Phe Glu Lys Met Ala Gin Leu Asn 
340 345 350 

Asn Lys Leu Glu Ser Arg Gly Leu Val * Gly Val Glu Cys He Thr Pro 
355 360 365 

Tyr Ser Tyr Lys Ser Leu Ala Met Ala Leu Gin Arg Cys Val Glu Val 
370 375 380 

Leu Ser Asp Glu Asp Arg Ser Ala Leu Ala Phe Ala Val Val Met Pro 
385 390 395 400 

Pro Gly Val Asp He Pro Val Lys Leu Trp Ser Cys Val He Pro Val 
405 410 415 

Asp He Cys Ser Asn Glu Glu Glu Gin Leu Asp Asp Glu Val Ala Asp 
420 425 430 

Arg Leu Lys Arg Leu Ser Lys Arg Gly Ala Leu Leu Ser Gly Lys Arg 
435 440 445 

Met Pro Val Leu Thr Phe Lys He Asp His He He His Met Phe Leu 
450 455 460 

Lvs His Val Val Asp Ala Gin Thr He Ala Asn Gly He Ser He Leu 
465 470 475 480 

Glu Gin Arg Leu Leu Glu He Gly Asn Asn Asn Val Ser Val Pro Glu 
485 490 495 

Arg His He Pro Ser His Phe Gin Lys Phe Arg Arg Ser Ser Ala Ser 
500 505 510 

Glu Met Tyr Pro Lys Thr Thr Glu Glu Thr Val He Arg Pro Glu Asp 
515 520 525 
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Phe Pro Lys Phe Met Gin Leu His Gin Lys Phe Tyr Asp Ser Leu Lys 
530 535 540 

Asn Phe Ala Cys Cys 
545 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

Asp Gin Asp Lys Ser Gly Phe He Glu Glu Asp Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Asp Gin Asp Lys Asp Asp Phe He Gly Glu Asp Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Asp Ser Asp Gly Asp His Lys He Gly Val Asp Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
Asp He Asn Lys Asp Asp Val Val Ser Trp Glu Glu 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

Ala Lys Glu Gly Asp Pro Gin Leu Ser Lys Glu Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

Asp Lys Asn Gly Asp Gly Glu Cys Ser Phe Glu Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Asp Ala Asp Gly Gly Gly Asp He Ser Val Lys Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
Asp Glu Asp Gly Ser Gly Thr He Asp Phe Glu Glu 
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(2) INFORMATION FOR SEQ ID NO: 11: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: not relevant 

(ii) MOLECULE TYPE: prote N in 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

Asp Arg Asn Ala Asp Gly Tyr He Asp Ala Glu Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Asp Lys Asn Asn Asp Gly Arg He Asp Phe Asp Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

Asp Lys Asp Gly Asn Gly Thr He Thr Thr Lys Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Leu Gly Glu Asp Asn He Asn Val Val Glu Gly Asn Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 



7/ 



(D) TOPOLOGY: linear 

(li) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

Asp Asn Asp Asn Asp Lys Phe Glu Gly Asn Cys Ala Glu 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

Glv Val Asp Pro Ser Arg Lys Glu Asn His Leu Ser 
15 10 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Asp Leu Asn Lys Asp Gly Gin lie Gin He Glu 
15 10 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

AGATCTGAAA TAAGGTGATA AATTAATAAA TTAAGTGTAT TTCTGAGGAA ATTTGACTGT 60 

TTTAGCACAA TTAATCTTGT TTCAGAAAAA AAGTCCAGTT TTCTAGATTT TTCCGTCTTA 120 

TTGTCGAATT AATATCCCTA TTATCACTTT TTCATGCTCA TCCTCGAGCG GCACGTCCTC 180 

AAAGAATTGT GAGAGCAAAC GCGCTCCCAT TGACCTCCAC ACTCAGCCGC CAAAACAAAC 240 

GTTCGAACAT TCGTGTGTTG TGCTCCTTTT CCGTTATCTT GCAGTCATCT TTTGTCGTTT 300 

TTTTCTTTGT TCTTTTTGTT GAACGTGTTG CTAAGCAATT ATTACATCAA TTGAAGAAAA 360 

GGCTCGCCGA TTTATTGTTG CCAGAAAGAT TCTGAGATTC TCGAAGTCGA TTTTATAATA 420 
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TTTAACCTTG GTTTTTGCAT TGTTTCGTTT AAAAAAACCA CTGTTTATGT GAAAAACGAT 480 

TAGTTTACTA ATAAAACTAC TTTTAAACCT TTACCTTTAC CTCACCGCTC CGTGTTCATG 540 

GCTCATAGAT TTTCGATACT CAAATCCAAA AATAAATTTA CGAGGGCAAT TAATGTGAAA 600 

CAAAAACAAT CCTAAGATTT CCACATGTTT GACCTCTCCG GCACCTTCTT CCTTAGCCCC 660 

ACCACTCCAT CACCTCTTTG GCGGTGTTCT TCGAAACCCA CTTAGGAAAG CAGTGTGTAT 720 

CTCATTTGGT ATGCTCTTTT CGATTTTATA GCTCTTTGTC GCAATTTCAA TGCTTTAAAC 780 

AATCCAAATC GCATTATATT TGTGCATGGA GGCAAATGAC GGGGTTGGAA TCTTAGATGA 840 

GATCAGGAGC TTTCAGGGTA AACGCCCGGT TCATTTTGTA CCACATTTCA TCATTTTCCT 900 

GTCGTCCTTG GTATCCTCAA CTTGTCCCGG TTTTGTTTTC GGTACACTCT TCCGTGATGC 960 

CACCTGTCTC CGTCTCAATT ATCGTTTAGA AATGTGAACT GTCCAGATGG GTGACTCATA 1020 

TTGCTGCTGC TACAATCCAC TTTCTTTTCT CATCGGCAGT CTTACGAGCC CATCATAAAC 1080 

TTTTTTTTCC GCGAAATTTG CAATAAACCG GCCAAAAACT TTCTCCAAAT TGTTACGCAA 1140 

TATATACAAT CCATAAGAAT ATCTTCTCAA TGTTTATGAT TTCTTCGCAG CACTTTCTCT 1200 

TCGTGTGCTA ACATCTTATT TTTATAATAT TTCCGCTAAA ATTCCGATTT TTGAGTATTA 1260 

ATTTATCGTA AAATTATCAT AATAGCACCG AAAACTACTA AAAATGGTAA AAGCTCCTTT 1320 

TAAATCGGCT CGACATTATC GTATTAAGGA ATCACAAAAT TCTGAGAATG CGTACTGCGC 1380 

AACATATTTG ACGGCAAAAT ATCTCGTAGC GAAAACTACA GTAATTCTTT AAATGACTAC 1440 

TGTAGCGCTT GTGTCGATTT ACGGGCTCAA TTTTTGAAAA TAATTTTTTT TTTCGAATTT 1500 

TGATAACCCG TAAATCGTCA CAACGCTACA GTAGTCATTT AAAGGATTAC TGTAGTTCTA 1560 

GCTACGAGAT ATTTTGCGCG CCAAATATGA CTGTAATACG CATTCTCTGA ATTTTGTGTT 1620 

TCCGTAATAA TTTCACAAGA TTTTGGCATT CCACTTTAAA GGCGCACAGG ATTTATTCCA 1680. 

ATGGGTCTCG GCACGCAAAA AGTTTGATAG ACTTTTAAAT TCTCCTTGCA TTTTTAATTC 1740 

AATTACTAAA ATTTTCGTGA ATTTTTCTGT TAAAATTTTT AAAATCAGTT TTCTAATATT 1800 

TTCCAGGCTG ACAAACAGAA ACAAAAACAC AACAAACATT TTAAAAATCA GTTTTCAAAT 1860 

TAAAAATAAC GATTTCTCAT TGAAAATTGT GTTTTATGTT TGCGAAAATA AAAGAGAACT 1920 

GATTCAAAAC AATTTTAACA AAAAAAAACC CCAAAATTCG CCAGAAATCA AGATAAAAAA 1980 

TTCAAGAGGG TCAAAATTTT CCGATTTTAC TGACTTTCAC CTTTTTTTTC GTAGTTCAGT 2040 

GCAGTTGTTG GAGTTTTTGA CGAAAACTAG GAAAAAAATC GATAAAAATT ACTCAAATCG 2100 

AGCTGAATTT TGAGGACAAT GTTTAAAAAA AAACACTATT TTTCCAATAA TTTCACTCAT 2160 

TTTCAGACTA AATCGAAAAT CAAATCGTAC TCTGACTACG GGTCAGTAGA GAGGTCAACC 2220 

ATCAGCCGAA GATGATGCGT CAAGATAGAA GGAGCTTGCT AGAGAGGAAC ATTATGATGT 2280 

TCTCTAGTCA TCTAAAAGTC GATGAAATTC TCGAAGTTCT CATCGCAAAA CAAGTGTTGA 2340 
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ATAGTGATAA TGGAGATATG ATTAATGTGA GTTTTTAATC GAATAATAAT TTTAAAAAAA 2400 

AATTGATAAT ATAAAGAATA TTTTTGCAGT CATGTGGAAC GGTTCGCGAG AAGAGACGGG 2460 

AGATCGTGAA AGCAGTGCAA CGACGGGGAG ATGTGGCGTT CGACGCGTTT TATGATGCTC 2520 

TTCGCTCTAC GGGACACGAA GGACTTGCTG AAGTTCTTGA ACCTCTCGCC AGATCGTAGG 2580 

TTTTTAAAGT TCGGCGCAAA AGCAAGGGTC TCACGGAAAA AAGAGGCGGA TCGTAATTTT 2640 

GCAACCCACC GGCACGGTTT TTTCCTCCGA AAATCGGAAA TTATGCACTT TCCCAAATAT 2700 

TTGAAGTGAA ATATATTTTA TTTACTGAAA GCTCGAGTGA TTATTTATTT TTTAACACTA 2760 

ATTTTCGTGG CGCAAAAGGC CATTTTGTAG ATTTGCCGAA AATACTTGTC ACACACACAC 2820 

ACACACATCT CCTTCAAATA TCCCTTTTTC CAGTGTTGAC TCGAATGCTG TCGAATTCGA 2880 

GTGTCCAATG TCACCGGCAA GCCATCGTCG GAGCCGCGCA TTGAGCCCCG CCGGCTACAC 2940 

TTCACCGACC CGAGTTCACC GTGACAGCGT CTCTTCAGTG TCATCATTCA CTTCTTATCA 3000 

GGATATCTAC TCAAGAGCAA GATCTCGTTC TCGATCGCGT GCACTTCATT CATCGGATCG 3060 

ACACAATTAT TCATCTCCTC CAGTCAACGC ATTTCCCAGC CAACCTTGTA TGTTGATGCG 3120 

AACACTAAAT TCTGAGAATG CGCATTACTC AACATATTTG ACGCGCAAAT ATCTCGTAGC 3180 

GAAAAATACA GTAACCCTTT AAATGACTAT TGTAGTGTCG ATTTACGGGC TCGATTTTCG 3240 

AAACGAATAT ATGCTCGAAT TGTGACAACG AATTTTAATT TGTCATTTTT GTGTTTTCTT 3300 

TTGATATTTT TGATCAATTA ATAAATTATT TCCGTAAACA GACACCAGCG CTACAGTACT 3360 

CTTTTAAAGA GTTACAGTAG TTTTCGCTTC AAGATATTTT GAAAAGAATT TTAAACATTT 3420 

TGAAAAAAAA TCATCTAACA TGTGCCAAAA CGCTTTTTTC AAGTTTCGCA GATTTTTTGA 3480 

TTTTTTTCAT TCAAGATATG CTTATTAACA CATATAATTA TCATTAATGT GAATTTCTTG 3540 

TAGAAATTTT GGGCTTTTCG TTCTAGTATG CTCTACTTTT GAAATTGCTC AACGAAAAAA 3600. 

TCATGTGGTT TGTTCATATG AATGACGAAA AATAGCAATT TTTTATATAT TTTCCCCTAT 3660 

TCATGTTGTG CAGAAAAATA GTAAAAAAGC GCATGCATTT TTCGACATTT TTTACATCGA 3720 

ACGACAGCTC ACTTCACATG CTGAAGACGA GAGACGCGGA GAAATACCAC ACATCTTTCT 3780 

GCGTCTCTCG TCTTCAGCAT GTGAAATGGG ATCTCGGTCG ATGTAAAAAA ATGTCGAATA 3840 

ATGTAAAAAA TGCATGCGTT TTTTTACACT TTTCTGCACA AATGAATAGG GGGAAAATGT 3900 

ATTAAAATAC ATTTTTTGTA TTTTTCAACA TCACATGATT AACCCCATTA TTTTTTCGTT 3960 

GAGCAACTTA AAAAGTAGAG AATATTAGAG CGAAAACCAA AATTTCTTCA AGATATTACC 4020 

TTTATTGATA ATTATAGATG TTAATAAGCA TATCTTGAAT GAAAGTCAGC AAAAATATGT 4080 

GCGAAACACC TGAAAAAAAT CAAAAATTCT GCGAAAATTG AAAAAATGCA TTAAAATACA 4140 

TTTTTGCATT TTTCTACATC ACATGAATGT AGAAAATTAA AAGQGAAATC AAAATTTCTA 4200 

GAGGATATAA TTGAATGAAA CATTGCGAAA TTAAAATGTG CGAAACGTCA AAAAAGAGGA 4260 
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AATTTGGGTA TCAAAATCGA TCCTAAAACC AACACATTTC AGCATCCGCC AACTCTTCAT 4320 

TCACCGGATG CTCTTCTCTC GGATACAGTT CAAGTCGTAA TCGCTCATTC AGCAAAGCTT 4380 

CTGGACCAAC TCAATACATA TTCCATGAAG AGGATATGAA CTTTGTCGAT GCACCAACCA 4440 

TAAGCCGTGT TTTCGACGAG AAAACCATGT ACAGAAACTT CTCGAGTCCT CGTGGAATGT 4500 

GCCTCATCAT AAATAATGAA CACTTTGAGC AGATGCCAAC ACGGAATGGT ACCAAGGCCG 4560 

ACAAGGACAA TCTTACCAAT TTGTTCAGAT GCATGGGCTA TACGGTTATT TGCAAGGACA 4620 

ATCTGACGGG AAGGGTACGG CGAAATTATA TTACCCAAAC GCGAAATTTG CCATTTTGCG 4680 

CCGAAAATGT GGCGCCCGGT CTCGACACGA CAATTTGTGT TAAATGCAAA AATGTATAAT 4740 

TTTGCAAAAA ACAAAATTTT GAACTTCCGC GAAAATGATT TACCTAGTTT CGAAATTTTC 4800 

GTTTTTTCCG GCTACATTAT GTGTTTTTTC TTAGTTTTTC TATAATATTT GATGTAAAAA 4860 

ACCGTTTGTA AATTTTCAGA CAATTTTCCG CATACAAAAC TTGATAGCAC GAAATCAATT 4920 

TTCTGAATTT TCAAAATTAT CCAAAAATGC ACAATTTAAA ATTTGTGAAA ATTGGCAAAC 4980 

GGTGTTTCAA TA TGAAATGT ATTTTTAAAA ACTTTAAAAA CCACTCCGGA AAAGCAATAA 5040 

AAATCAAAAC AACGTCACAA TTCAAATTCA AAAGTTATTC ATCCGATTTG TTTATTTTTG 5100 

CAAAATTTGA AAAAATCATG AAGGATTTAG AAAAGTTTTA TAACATTTTT TCTAGATTTT 5160 

TCAAAATTTT TTTTAACAAA TCGAGAAAAA GAGAATGAAA AATCGATTTT AAAAATATCC 5220 

ACAGCTTCGA GAGTTTGAAA TTACAGTACT CCTTAAAGGC GCACACCCCA TTTGCATTGG 5280 

ACCAAAAATT TGTCGTGTCG AGACCAGGTA CCGTAGTTTT TGTCGCAAAA ATTGCACCAT 5340 

TGGACAATAA ACCTTCCTAA TCACCAAAAA GTAAAATTGA AATCTTCGAA AAGCCAAAAA 5400 

ATTCAAAAAA AAAGTCGAAT TTCGATTTTT TTTTTGGTTT TTTGGTCCCA AAAACCAAAA 5460 

AAATCAATTT TCTGCAAAAT ACCAAAAAGA AACCCGAAAA AATTTCCCAG CCTTGTTCCT 5520 

AATGTAAACT GATATTTAAT TTCCAGGGAA TGCTCCTGAC AATTCGAGAC TTTGCCAAAC 5580 

ACGAATCACA CGGAGATTCT GCGATACTCG TGATTCTATC ACACGGAGAA GAGAATGTGA 5640 

TTATTGGAGT TGATGATATA CCGATTAGTA CACACGAGAT ATATGATCTT CTCAACGCGG 5700 

CAAATGCTCC CCGTCTGGCG AATAAGCCGA AAATCGTTTT TGTGCAGGCT TGTCGAGGCG 5760 

GTTCGTTTTT TATTTTAATT TTAATATAAA TATTTTAAAT AAATTCATTT TCAGAACGTC 5820 

GTGACAATGG ATTCCCAGTC TTGGATTCTG TCGACGGAGT TCCTGCATTT CTTCGTCGTG 5880 

GATGGGACAA TCGAGACGGG CCATTGTTCA ATTTTCTTGG ATGTGTGCGG CCGCAAGTTC 5940 

AGGTTGCAAT TTAATTTCTT GAATGAGAAT ATTCCTTCAA AAAATCTAAA ATAGATTTTT 6000 

ATTCCAGAAA GTCCCGATCG AAAAATTGCG ATATAATTAC GAAATTTGTG ATAAAATGAC 6060 

AAACCAATCA GCATCGTCGA TCTCCGCCCA CTTCATCGGA TTGGTTTGAA AGTGGGCGGA 6120 

GTGAATTGCT GATTGGTCGC AGTTTTCAGT TTAGAGGGAA TTTAAAAATC GCCTTTTCGA 6180 
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(2) INFORMATION FOR SEQ ID NO: 19: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

Met Met Arg Gin Asp Arg Arg Ser Leu Leu Glu Arg Asn lie Met Met 
15 10 15 
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Phe Ser Ser His Leu Lys Val Asp Glu He Leu Glu Val Leu He Ala 
20 25 30 



Lys Gin Val Leu Asn Ser Asp Asn Gly Asp Met He Asn Ser Cys Gly 
35 40 45 

Thr Val Arg Glu Lys Arg Arg Glu He Val Lys Ala Val Gin Arg Arg 
50 55 60 

Gly Asp Val Ala Phe Asp Ala Phe Tyr Asp Ala Leu Arg Ser Thr Gly 
65 70 75 80 

His Glu Gly Leu Ala Glu Val Leu Glu Pro Leu Ala Arg Ser Val Asp 
85 90 95 

Ser Asn Ala Val Glu Phe Glu Cys Pro Met Ser Pro Ala Ser His Arg 
100 105 110 

Arg Ser Arg Ala Leu Ser Pro Ala Gly Tyr Thr Ser Pro Thr Arg Val 
115 120 125 

His Arg Asp Ser Val Ser Ser Val Ser Ser Phe Thr Ser Tyr Gin Asp 
130 135 140 

He Tyr Ser Arg Ala Arg Ser Arg Ser Arg Ser Arg Ala Leu His Ser 
145 150 155 160 

Ser Asp Arg His Asn Tyr Ser Ser Pro Pro Val Asn Ala Phe Pro Ser 
165 170 175 

Gin Pro Ser Ser Ala Asn Ser Ser Phe Thr Gly Cys Ser Ser Leu Gly 
180 185 190 

Tyr Ser Ser Ser Arg Asn Arg Ser Phe Ser Lys Ala Ser Gly Pro Thr 
195 200 205 

Gin Tyr He Phe His Glu Glu Asp Met Asn Phe Val Asp Ala Pro Thr 
210 215 220 

He Ser Arg Val Phe Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser 
225 230 235 240 

Pro Arg Gly Met Cys Leu He He Asn Asn Glu His Phe Glu Gin Met 
245 250 255 

Pro Thr Arg Asn Gly Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn Leu 
260. 265 270 

Phe Arg Cys Met Gly Tyr Thr Val He Cys Lys Asp Asn Leu Thr Gly 
275 280 285 

Arg Gly Met Leu Leu Thr He Arg Asp Phe Ala Lys His Glu Ser His 
290 295 300 

Gly Asp Ser Ala He Leu Val He Leu Ser His Gly Glu Glu Asn Val 
305 310 315 320 

He He Gly Val Asp Asp He Pro He Ser Thr His Glu He Tyr Asp 
325 330 335 

Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys He 
340 345 350 

Val Phe Val Gin Ala Cys Arg Gly Glu Arg Arg Asp Asn Gly Phe Pro 



355 360 365 

Val Leu Asp Ser Val Asp Gly Val Pro Ala Phe Leu Arg Arg Gly Trp 
370 375 380 

Asp Asn Arg Asp Gly Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro 
385 390 395 400 

Gin Val Gin Gin Val Trp Arg Lys Lys Pro Ser Gin Ala Asp lie Leu 
405 410 415 

lie Arg Tyr Ala Thr Thr Ala Gin Tyr Val Ser Trp Arg Asn Ser Ala 
420 425 430 

Arg Gly Ser Trp Phe lie Gin Ala Val Cys Glu Val Phe Ser Thr His 
435 440 445 

Ala Lys Asp Met Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys 
450 455 460 

Val Ala Cys Gly Phe Gin Thr Ser Gin Gly Ser Asn lie Leu Lys Gin 
465 470 475 480 

Met Pro Glu Met Thr Ser Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro 
485 490 495 

Glu Ala Arg Asn Ser Ala Val 
500 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 505 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

Met Met Arg Gin Asp Arg Trp Leu Leu Glu Arg Asn lie Leu Glu Phe 
1 5 10 15 

Ser Ser Lys Leu Gin Ala Asp Leu lie Leu Asp Val Leu lie Ala Lys 
20 25 30 

Gin Val Leu Asn Ser Asp Asn Gly Asp Val He Asn Ser Cys Arg Thr 
35 40 45 

Glu Arg Asp Asn Glu Lys Glu He Val Lys Ala Val Gin Arg Arg Gly 
50 55 60 

Asp Glu Ala Phe Asp Ala Phe Tyr Asp Ala Leu Arg Asp Thr Gly His 
65 70 75 80 

Asn Asp Leu Ala Asp Val Leu Met Pro Leu Ser Arg Pro Xaa Xaa Xaa 
85 90 95 

Asn Pro Val Pro Met Glu Cys Pro Met Ser Pro Ser Ser His Arg Arg 
100 105 110 

Ser Arg Ala Leu Ser Pro Pro Xaa Tyr Ala Ser Pro Thr Arg Val His 
115 120 125 



Asp Ser He Ser Ser Val Ser Ser Phe Thr Ser Thr Tyr Gin Asp 



Ser Ser Asp Arg His Asn Tyr Met Ser Ala Ala Thr Ser Phe Pro Ser 
165 170 175 

Gin Pro Xaa Ser Ala Asn Ser Ser Phe Thr Gly Cys Ala Ser Leu Gly 
180 185 190 

Tyr Ser Ser Ser Arg Asn Arg Ser Phe Ser Lys Thr Ser Ala Gin Ser 
195 200 205 

Gin Tyr He Phe His Glu Glu Asp Met Asn Tyr Val Asp Ala Pro Thr 
210 215 220 

He His Arg Val Phe Asp Glu Lys Thr Met Tyr Arg Asn Phe Ser Ser 
225 230 235 240 

Pro Arg Gly Leu Cys Leu He He Asn Asn Glu His Phe Glu Gin Met 
245 250 255 

Pro Thr Arg Asn Gly Thr Lys Ala Asp Lys Asp Asn Leu Thr Asn He 
260 265 270 

Phe Arg Cys Met Gly Tyr Thr Val He Cys Lys Asp Asn Leu Thr Gly 
275 280 285 

Arg Glu Met Leu Ser Thr He Arg Ser Phe Gly Arg Asn Asp Met His 
290 295 300 

Gly Asp Ser Ala He Leu Val He Leu Ser His Gly Glu Xaa Asn Val 
305 310 315 320 

He He Gly Val Asp Asp Val Ser Val Asn Val His Glu He Tyr Asp 
325 330 335 

Leu Leu Asn Ala Ala Asn Ala Pro Arg Leu Ala Asn Lys Pro Lys Leu 
340 345 350 

Val Phe Val Gin Ala Cys Arg Gly Glu Arg Arg Asp Asn Gly Phe Pro 
355 360 365 

Val Leu Asp Ser Val Asp Gly Val Pro Ser Leu He Arg Arg Gly Trp 
370 375 380 

Asp Asn Arg Asp Gly Pro Leu Phe Asn Phe Leu Gly Cys Val Arg Pro 
385 390 395 400 

Gin Val Gin Gin Val Trp Arg Lys Lys Pro Ser Gin Ala Asp Met Leu 
405 410 415 

He Ala Tyr Ala Thr Thr Ala Gin Tyr Val Ser Trp Arg Asn Ser Ala 
420 425 430 

Arg Gly Ser Trp Phe He Gin Ala Val Cys Glu Val Phe Ser Leu His 
435 440 445 

Ala Lys Asp Met Asp Val Val Glu Leu Leu Thr Glu Val Asn Lys Lys 
450 455 460 

Val Ala Cys Gly Phe Gin Thr Ser Gin Gly Ser Asn He Leu Lys Gin 



130 



135 



140 



Val Tyr Ser Arg Ala Arg Ser Ser Ser Arg Ser Ser Arg Pro Leu Gin 
145 150 155 160 




m 



465 



470 



475 



480 



Met Pro Glu Leu Thr Ser Arg Leu Leu Lys Lys Phe Tyr Phe Trp Pro 



INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 479 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : not relevant 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

Thr Val Ser lie Ser Leu He He Ala Arg Gin Val Leu Asn Ser Asp 
! 5 10 15 

Asn Gly Asp Met He Asn Ser Cys Gly Thr Val Arg Glu Lys Arg Arg 
20 25 30 

Glu He Val Lys Ala Val Gin Arg Arg Gly Asp Val Ala Phe Asp Ala 
35 40 45 

Phe Tyr Asp Ala Leu Arg Ser Thr Gly His Glu Gly Leu Ala Glu Val 
50 55 60 

Leu Glu Pro Leu Ala Arg Ser Val Asp Ser Asn Ala Val Glu Phe Glu 
65 70 75 80 

Cys Pro Met Ser Pro Ala Ser His Arg Arg Ser Arg Ala Leu Ser Pro 
85 90 95 

Ala Gly Tyr Thr Ser Pro Thr Arg Val His Arg Asp Ser Val Ser Ser 
100 105 110 

Val Ser Ser Phe Thr Ser Tyr Gin Asp He Tyr Ser Arg Ser Thr Ser 
115 120 125 

Ser Ser Ser Arg Pro Leu His Thr Ser Asp Arg His Asn Tyr Val Ser 
130 135 140 

Pro Ser Asn Ser Phe Gin Ser Gin Pro Ala Ser Ala Asn Ser Ser Phe 
145 150 155 160 

Thr Gly Ser Ser Ser Leu Gly Tyr Ser Ser Ser Arg Thr Arg Ser Tyr 
165 170 175 

Ser Lys Ala Ser Ala His Ser Gin Tyr He His Glu Glu Asp Met Asn 
180 185 190 

Tyr Val Asp Ala Pro Thr He His Arg Val Phe Asp Glu Lys Thr Met 
195 200 205 

Tyr Arg Asn Phe Ser Thr Pro Arg Gly Leu Cys Leu He He Asn Asn 
210 215 220 

Glu His Phe Glu Gin Met Pro Thr Arg Asn Gly Thr Lys Pro Asp Lys 
225 230 235 240 



485 



490 



495 



Glu Asp Arg Gly Arg Asn Ser Ala Val 
500 505 




Asp Asn He Ser Asn Leu Phe Arg Cys Met Gly Tyr He Val His Cys 
245 250 255 

Lys Asp Asn Leu Thr Gly Arg Gly Met Met Leu Thr He Arg Asp Phe 
260 265 270 

Ala Lys Asn Glu Thr His Gly Asp Ser Ala He Leu Val He Leu Ser 
275 280 285 

His Gly Glu Glu Asn Val He He Gly Val Asp Asp Val Ser Val Asn 
290 295 300 

Val His Glu He Tyr Xaa Leu Leu Asn Ala Ala Ash Ala Pro Arg Leu 
305 310 315 320 

Ala Asn Lys Pro Lys Leu Val Phe Val Gin Ala Cys Arg Gly Glu Arg 
325 330 335 

Arg Asp Val Gly Phe Pro Val Leu Asp Ser Val Asp Gly Val Pro Ala 
340 345 350 

Leu He Arg Arg Gly Trp Asp Lys Gly Asp Gly Pro Leu Phe Asn Phe 
355 360 365 

Leu Gly Cys Val Arg Pro Gin Ala Gin Gin Val Trp Arg Lys Lys Pro 
370 375 380 

Ser Gin Ala Asp He Leu He Ala Tyr Ala Thr Thr Ala Gin Tyr Val 
385 390 395 400 

Ser Trp Arg Asn Ser Ala Arg Gly Ser Trp Phe He Gin Ala Val Cys 
405 410 415 

Glu Val Phe Ser Leu His Ala Lys Asp Met Asp Val Val Glu Leu Leu 
420 425 430 

Thr Glu Val Asn Lys Lys Val Ala Cys Gly Phe Gin Thr Ser Gin Gly 
435 440 445 

Ala Asn He Leu Lys Gin Met Pro Glu Leu Thr Ser Arg Leu Leu Lys 
450 455 460 

Lys Phe Tyr Phe Trp Pro Glu Asp Arg Asn Arg Ser Ser Ala Val 
465 470 475 

(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATTGGCGATC CTCTCGA 
(2) INFORMATION FOR SEQ ID NO:23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDED NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
TCATCGACTT TTAGATGACT AGAGAACATC 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D ) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
GTTGCACTGC TTTCACGATC TCCCGTCTCT 
(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GTTTAATTAC CCAAGTTTGA G 
(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GGTTTTAACC AGTTACTCAA G 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
CCGGTGACAT TGGACACTC 



fit 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

( B ) TYPE : nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 



ACTATTCAAC ACTTG 



